Mixture autoregressive hidden Markov models for speech signals
نویسندگان
چکیده
In this paper a signal modeling technique based upon finite mixture autoregressive probabilistic functions of Markov chains is developed and applied to the problem of speech recognition, particularly speaker-independent recognition of isolated digits. Two types of mixture probability densities are investigated: finite mixtures of Gaussian autoregressive densities (GAM) and nearest-neighbor partitioned finite mixtures of Gaussian autoregressive densities (PGAM). In the former (GAM), the observation density in each Markov state is simply a (stochastically constrained) weighted sum of Gaussian autoregressive densities, while in the latter (PGAM) it involves nearest-neighbor decoding which in effect, defines a set of partitions on the observation space. In this paper we discuss the signal modeling methodology and give experimental results on speaker independent recognition of isolated digits. We also discuss the potential use of the modeling technique for other applications. S
منابع مشابه
Nonlinear mixture autoregressive hidden Markov models for speech recognition
Gaussian mixture models are a very successful method for modeling the output distribution of a state in a hidden Markov model (HMM). However, this approach is limited by the assumption that the dynamics of speech features are linear and can be modeled with static features and their derivatives. In this paper, a nonlinear mixture autoregressive model is used to model state output distributions (...
متن کاملOn the application of hidden Markov models for enhancing noisy speech
w e ppose a new algorithm for enhancing noisy speech which have been degraded by statistically independent additive noise. The al p rithm is based upon modeling the clean speech as a hidden Markov process with mixtures of Gaussian autoregressive (AR) output processes, and the noise process as a sequence of stationary, statistically independent, Gaussian AR vectors. The parameter sets of the mod...
متن کاملSpeaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words
متن کامل
Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words
متن کامل
On nonstationary hidden Markov modeling of speech signals
We propese an exact maximum likelihood (ML) approach for hidden Markov modeling of speech signals using models with mixtures of Gaussian autoregressive (AR) output probability distributions. This approach differs from the commonly used approach in two aspects. First, the parameters of the AR models are calculated using the exact, rather than the asymptotic, form of the likelihood function. Seco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Acoustics, Speech, and Signal Processing
دوره 33 شماره
صفحات -
تاریخ انتشار 1985